FUNCTIONS OF RANDOM WALKS ON HYPERPLANE 

ARRANGEMENTS 



CHRISTOS A. ATHANASIADIS AND PERSI DIACONIS 

Abstract. Many seemingly disparate Markov chains are unified when viewed as random 
walks on the set of chambers of a hyperplane arrangement. These include the Tsetlin 
library of theoretical computer science and various shuffling schemes. If only selected 
features of the chains are of interest, then the mixing times may change. We study the 
behavior of hyperplane walks, viewed on a sub-arrangement of a hyperplane arrangement. 
These include many new examples, for instance a random walk on the set of acyclic 
orientations of a graph. All such walks can be treated in a uniform fashion, yielding 
diagonalizable matrices with known eigenvalues, stationary distribution and good rates of 
convergence to stationarity. 



I. Introduction 

Many seemingly disparate Markov chains may be successfully studied by viewing them 
as random walks on the set of chambers of a hyperplane arrangement [9]. These include 
the Tsetlin library of theoretical computer science, a variety of walks on the hypercube and 
various shuffling schemes [13] . If only selected features of such a Markov chain are of interest 
(for instance, only a few sites on the hypercube or the relative ordering of the top few cards), 
then the mixing time may change. Following a suggestion of Uyemura Reyes [IB], we 
study the behavior of hyperplane walks, viewed on subarrangements of a given hyperplane 
arrangement. This leads to new Markov chains which permit a full analysis. The following 
two examples illustrate our results and are used as running examples throughout. 

Example 1.1 (Conquering Territory). Consider an m x n grid, with each node labeled 
with +1 or —1. At each stage, a node is chosen from a fixed probability distribution, then a 
neighborhood of this node is chosen and finally, all labels of the nodes in this neighborhood 
are changed to +1 or all are changed to —1, according to a specific distribution. As ex- 
plained in Section [4], this Markov chain can be viewed as a hyperplane walk on the Boolean 
arrangement. Such walks were first studied in [HI [32] and include the classical Ehrenfest 
urn. The stationary distribution depends on the various probabilistic specifications but 
the theory of [9j [13] , reviewed in Section [2j gives a useful description of this distribution, 
as well as of the eigenvalues and rates of convergence to stationarity. 
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Suppose now that only the labels of a few nodes (for instance, the four corners or the 
middle node) are of interest. Common sense suggests that the induced process on these 
nodes may converge to stationarity at a faster rate than the entire chain. For example, in 
the Ehrenfest urn with n particles, order of n log n steps are required to equilibriate on the 
full state space but order of n steps suffice for a few tagged particles. Further details and 
examples appear in the sequel. □ 



Sometimes the induced chain is the object of direct interest, with the original chain 
opaque in the background. This is the case in our second example. 



Example 1.2 (Acyclic Orientations). Let Q be a simple undirected graph. A Markov 
chain on the set of acyclic orientations of Q can be defined as follows: At each stage, a 
node v of Q is chosen from a fixed probability distribution w and all edges of Q incident 
to v are oriented inward, towards v. Under some mild assumptions on Q and w, this is an 
ergodic Markov chain on the set of acyclic orientations of Q with describable stationary 
distribution and eigenvalues and with good control on rates of convergence. It arises as the 
chain induced from the Tsetlin library on the braid arrangement, where the subarrangement 
is the graphical arrangement corresponding to Q. It also arises as a walk on the Boolean 
arrangement; see Section [4] for a detailed discussion. □ 



This paper is organized as follows. Section [2] includes background on hyperplane walks 
and functions of a Markov chain, along with an overview of the basic examples of hyperplane 
walks on the Boolean and braid arrangements. Our main results appear in Section [3j 
Developing a suggestion in [46], the process induced from a hyperplane walk on the set 
of chambers of a subarrangement is considered. Although a function of a Markov chain 
is usually not Markov, it is shown that subarrangement processes are Markov chains. 
Moreover, the subarrangement chains are shown to be hyperplane walks in their own right. 
This implies that the whole tool kit of results for hyperplane walks [HI [13] is available. One 
striking feature of general hyperplane walks is that they have nonnegative real eigenvalues, 
although these chains are almost never symmetric or reversible. A purely combinatorial 
proof of this fact, as well as a new coupling proof of the basic theorem of [13], giving rates 
of convergence to stationarity, also appear in Section [3j 

Sections[4]and[5]give applications of the general theory to the hyperplane walks of Section 
|2j treating Examples 1.1 and 1.2 in several variations. These include a variety of functions 
on the Tsetlin library and inverse a-shuffling Markov chains on the symmetric group, such 
as those assigning to a permutation the set of elements preceding a given entry in its linear 
representation, the descent set and the cyclic descent set. Most of the induced chains we 
study seem very different from their parents. As a byproduct of our considerations, we 
mention an interpretation for the unsigned coefficients of the chromatic polynomial of a 
graph as the multiplicities of the transition matrix of a certain natural Markov chain on the 
set of acyclic orientations of Q (see Proposition 5.11). Section [6] briefly discusses extensions 
to random walks on semigroups. 
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2. Background 

This section reviews the main results on hyperplane walks, develops needed examples 
(hypercube walks, Tsetlin library, inverse a-shuffles) and provides background on functions 
of a Markov chain. 



2.1. Hyperplane walks. Let A be a hyperplane arrangement in V = M n , meaning a finite 
set of affine hyperplanes in V. The intersection poset of A is the set = {fl £ : £ C A}, 
consisting of all affine subspaces of V which can be written as intersections of some of the 
hyperplanes of A, partially ordered by reverse inclusion. The space V, corresponding to 
£ = 0, is the minimum element of C^. 

The connected components of the space obtained from V by removing the union of the 
hyperplanes of A are called chambers. The restriction of A on an intersection subspace 
W G C_x is the hyperplane arrangement in W consisting of the intersections of W with 
the hyperplanes of A which are not parallel to (in particular, do not contain) W. The 
chambers of all such restricted arrangements are called faces of A. Thus the chambers of 
A are exactly its faces of dimension n. We will denote by C4 the set of chambers of A and 
by Ta its set of faces. The elements of Fa are the open cells of a regular cell decomposition 



of V; see [TOl Section 2.1]. Figure 2.1 shows an arrangement of four hyperplanes (lines) in 
IR 2 which has ten chambers, thirteen one-dimensional faces (open line segments) and four 
zero-dimensional faces (points). 




Given faces F,Gg Fa, we say that F is a face of G if F is contained in the closure of G, 
with respect to the standard topology on V. Given a chamber C G C4 and a face F G Fa, 
a lemma of Tits |45j asserts that among all chambers of A which have F as a face, there 
is a unique chamber C' which is closest to C, in the sense that the set of hyperplanes in A 
separating C' from C is minimum with respect to inclusion. We will refer to this chamber 
C' as the projection of C on F and will denote it by FC. For an alternative definition, pick 
any points x G F and y G C and move by a small distance away from x in the direction of 
y, in the line segment joining these two points. The resulting point lies in a well defined 
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face of A, which is the face FC. An example is given in Figure 2.1 The second recipe can 
be used to define the face FG G Ta for any two faces F,G G Ta- 

Using these ingredients, Bidigare, Hanlon and Rockmore [9] suggested the following 
family of Markov chains on the state space Ca- Start with a probability measure w on 
J- A- A step in the chain is given by choosing a face F G J- a from w and moving from the 
current chamber C G Ca to FC. Equivalently, we can describe this chain by defining its 
transition matrix K by the equation 

(2.1) K(C,C) = W ( F )- 

FeT A :FC=C 

Henceforth, we will refer to this Markov chain as the hyperplane walk on A (or on Cj) 
associated to w. 

Hyperplane walks being so general, it is surprising that there is a relatively complete 
theory for them. We recall that the total variation distance between two probability 
distributions P and Q on a finite set Q is defined as 

ll-P — QHtv = max \P(A) -Q{A)\. 

ACQ 



The measure w on Ta is said to be separating [13] if for every H G A there exists a face 
F G Ta such that F (I H and w(F) > 0. We combine the main results of [9], [13] into the 
following theorem. 

Theorem 2.1. Let A be a hyperplane arrangement in V with set of faces Ta and inter- 
section poset Cj, and let w be a probability measure on Ta- 

(i) The characteristic polynomial of K is given by 

det(xI-K) = Y[ (x-\ w ) mw , 

WdC A 

where 

(2.2) \ w = w ^ 

F£T A :FQW 

is an eigenvalue, 

m w = \fi A (y,W)\ = (-ir dha ^ v ^ A (v,w), 

Ha is the Mobius function of Ca and codim(H / , V) is the codimension ofW in V . 

(ii) The matrix K is diagonalizable. 

(iii) K has a unique stationary distribution it if and only if w is separating. 

(iv) Assume thatw is separating and let K l c be the distribution of the chain started from 
the chamber C after I steps. Then its total variation distance from tc satisfies 

(2.3) II^-ttHtv < P{F 1 F 2 ---F l ^C A }, 
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where (F x , F 2 , . . . ) consists of independent and identically distributed picks from the 
measure w on Ta, and 

(2.4) IM^-ttUtv < E X h- 

HeA 

Furthermore, there is a useful description of the stationary distribution which will not 
be detailed here; see [131 Theorem 2 (b)]. As noticed in [T3J Section 4], the right-hand side 



of (2.3) is bounded from above by that of (2.4). The bounds in (2.3) and (2.4) are usually 



good but not perfect. 

2.2. Examples. Our main examples involve the Boolean and braid arrangements. In 
what follows, we denote by & n the symmetric group of permutations of the set [n] := 
{1,2,..., n}. We will often use the one line notation (r(l), r(2), . . . , r(n)) for a permutation 
r G & n . It will be helpful to think of r as a linear ordering of a deck of n cards, labeled 
bijectively by the elements of [n]. 

A. The Boolean arrangement. Let A = {Hi, H 2 , . . . , H n } be the Boolean arrangement 
in V = W 1 , where Hi is the coordinate hyperplane in R n defined by the equation X{ = 0, 
for 1 < i < n. The intersection poset Ca is isomorphic to the lattice of subsets of the 
set [n], ordered by inclusion, where the isomorphism maps a subspace W G £4 to the 
set of indices i G [n] for which W C Hi. The Mobius function ha of this poset satisfies 
^ A (V, W) = (_i)«*Hm(w;v) for w e Ca 

The set Ta of faces of A is in bijection with the set { — ,0,+} n of all 3 n possible sign 
vectors of length n (where the bijection is induced by the map assigning to each point 
x G W 1 the sequence of signs of the coordinates of x). The chambers of A are the orthants 
in M. n ; they correspond to the 2 n elements of {—,+}"• Using these bijections, we may 
identify Ca and Ta with the sets {—,+}" and { — ,0,+} n , respectively (the former may 
also be identified with the set of vertices of the n-dimensional cube [—1, l] n ). The projection 
FC of a chamber C G Ca on a face F G Ta is the chamber whose sign vector is obtained 
from that of F by switching every zero coordinate to the corresponding coordinate of C. 
Thus, given a probability measure w on Ta, the hyperplane walk on A associated to w 
proceeds from the current chamber C G Ca by selecting a face F G Ta with probability 
w(F) and replacing the ith coordinate of C by the ith coordinate of F, whenever the latter 
is nonzero, to reach the next chamber in the walk. Some examples of these walks appear 
in [9j Section 2.3] and [131 Section 3B]. 



It follows from Theorem 2.1 that the transition matrix K for this chain has eigenvalues 



(2.5) X s = W (F), 

one for each S C [n], where Ts denotes the set of faces of A whose sign vectors have 
their ith coordinate equal to zero for every i 6 S, and that K has a unique stationary 
distribution tt if and only if for every 1 < % < n there exists a face F G Ta with nonzero 
ith coordinate, such that w(F) > 0. 
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The eigenvalues which correspond to the hyperplanes Hi and which appear in the right- 
hand side of (2.4), are the numbers X^y for 1 < i < n. 

Example 2.2. For e G { — , +} and 1 < i < n, we denote by Ef the face of A whose sign 
vector has its ith coordinate equal to e and all other coordinates equal to zero. We choose 
face weights so that w(Ef) = wf for e G {— , +} and 1 < i < n, and w (F) = for all other 
faces F G Ta, where the wf are nonnegative real numbers summing to 1. The resulting 
Markov chain is the nearest neighbor random walk on the vertex set { — ,+}" of the n- 
dimensional cube, which evolves by picking a coordinate i, at each stage, and switching it 
to e, with probability w\. Thus, the transition matrix K for this chain is given by 

if x' = x, 

if x' is obtained from x by switching the ith coordinate to — £j, 
otherwise 

+}" with x = (£1,82, ■ ■ ■ ,e n ). It has eigenvalues 

As = Wi > 

i£[n]\S 

one for each S C [n], where Wi = w~ + wf. Moreover, K has a unique stationary distri- 
bution 7i if and only if Wi > holds for every 1 < i < n. In that case, tc is given by the 
formula 




(2.6) tt(x) = H 



Wi 
1=1 



for x = (si,£2, ■ ■ ■ ,£n) G { — , +} n (see, for instance, [131 Section 3B]) and (2.4) gives the 
bound 



(2.7) \\K ~ kWtv < E( X 



Wi) 1 , 



8=1 



where K l x is the distribution of the chain started from x after / steps. □ 



B. The braid arrangement. Let A be the braid arrangement in IR n , consisting of the 
(2) hyperplanes defined by the equations Xi — Xj = for 1 < i < j < n. The intersection 
poset £_4 is isomorphic to the lattice of partitions of the set [n], ordered by refinement. 
The isomorphism maps a subspace W G to the partition of [n] in which i and j are in 
the same block if and only if Xi = Xj holds for every point (xi, 22, • • • , x n ) G W . 

The set C4 of chambers of A is in bijection with & n and the set of faces is in bijection 
with the set of ordered partitions of [n], meaning set partitions (B\, B 2 , . . . , Bf.) of [n] in 
which the order of the blocks matters. To be specific, let us agree that the permutation 
t G & n corresponds to the chamber 



X T (i) > X T ( 2 ) > • • • > x r(n) . 
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More generally, the ordered partition B = (Bi, B 2 , . . . , B^) of [n] corresponds to the face 
of A defined by the equalities Xi = Xj, whenever i and j lie in the same block of B, and 
the inequalities Xi > Xj, whenever the block of B which contains i has smaller index than 
the block of B which contains j. 

The action of faces on chambers can be described as follows. Let C G C4 be the 
chamber corresponding to r G & n and F G JFa be the face corresponding to the or- 
dered partition B of [n]. One can then check that FC is the chamber which corre- 
sponds to the permutation of [n] which is obtained from B by listing the elements of 
each block of B in the relative order in which they appear in r. For instance, if n = 9, 
r = (8, 1, 4, 9, 7, 2, 6, 3, 5) and B = ({6, 9}, {1,3, 7}, {4}, {2, 5, 8}), then the resulting per- 
mutation is equal to (9,6,1,7,3,4,8,2,5). In the sequel, we identify faces (respectively, 
chambers) of A with the corresponding ordered partitions (respectively, permutations) of 
the set [n] . 

Different choices of probability measure on J-"_4 lead to various interesting Markov chains 
on & n . We concentrate on the following two examples. 

Example 2.3 (Tsetlin Library). Let w\, W2, ■ ■ ■ , w n be nonnegative real numbers summing 
to 1. Choose face weights so that 



for an ordered partition B of [n]. The projection of r on B = ({«}, [n] \ {z}) removes the 
entry % in the one line notation of the permutation r and places it in front. Hence the 
transition matrix K is the n\ x n! matrix defined by 



This chain has been extensively studied as a model of dynamic storage allocation; see [20] 
for a survey and [7] for recent results. It was shown by Phatarfod [32], and follows easily 
from Theorem |2. 1| (see [UEI2]), that K is diagonalizable with eigenvalues 



one for each r G & n . Moreover, K has a unique stationary distribution ir if and only if 
we have to, = for at most one index i. In that case, tt is given by "sampling the weights 
without replacement to generate a random permutation" . Thus we have 




Wi, if B = ({«}, [n] \ {i}) for some 1 < % < n 
0, otherwise 




Wi, if t' is obtained from r by moving i in front, for some i 
0, otherwise. 



(2.8) 




(2.9) 



W T (i)W T (2) ■ ■ ■ UV(n-l) 



(1 - UV(1))(1 - Wr(l) - W t{2 )) •••(!- U> T (i) 




for r G & n . This stationary distribution is known as the Luce model in cognitive psychol- 
ogy; see [HI p. 174] for extensive references. Equation (2.4) of Theorem 2.1 (iv) gives the 
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bound 

(2.10) \\K l T -7r\\ TY < J2 

l<i<j<n 

on the rate of convergence to stationarity, where K T is the distribution of the chain started 
at r after I steps. Suppose, for instance, that Wi = 1/n for 1 < i < n, so that at each 
stage of the chain, an entry of the current permutation is selected uniformly at random 
and moved in front (thus the chain evolves by the "random to top" rule). Then we have 

(2.11) \\K-^< g) 

The expression on the right is bounded above by e _2c /2 if I > n(logn + c), for c > 0. 
In this case there is a matching lower bound which shows that n(logn + c) steps are in 
fact necessary and sufficient for convergence to stationarity; see [16] for further details and 
more refined asymptotics. □ 

Example 2.4 (Inverse a-shuffles). Ordinary riffle shuffles have received a careful analysis 
in [8]. A key to this analysis is a natural model on inverse riffle shuffles. Informally, 
begin with a deck of cards in order. Label the back of each card by one of the numbers 
in {1,2, ... ,a}, choosing the labels uniformly and independently. Then remove all cards 
labeled 1, keeping them in the same relative order, and start a new deck. Remove the 
cards labeled 2, keeping them in the same relative order, and place them below the ones 
labeled 1. Continue, placing the cards labeled a at the bottom. This can be seen as a 
random walk on the braid arrangement. 

More formally, let a > 2 be an integer and A be the braid arrangement in M. n , as before. 
A weak ordered partition of [n] is a sequence of pairwise disjoint sets (called blocks) whose 
union is equal to [n]. From such a sequence one gets an ordered partition of [n] by simply 
removing the empty blocks. We define a probability measure w on by first assigning 
weight equal to l/a n to each of the a n weak ordered partitions (Bi, B 2 , . . . , B a ) of [n] with 
a blocks and then letting w(B) equal the sum of the weights of all weak ordered partitions 
of [n] with a blocks which correspond to the ordered partition B. For instance, if a = 2, 
then 

{l/2 n ~\ if B = (M) 
l/2 n , if B = (s, [n] \ s) and s ^ 0, s ^ [n] 
0, otherwise 

for an ordered partition B of [n]. 

The resulting chain on & n proceeds from a given permutation r by selecting uniformly 
at random a weak ordered partition [B\,B2, . . . ,B a ) of [n] with a blocks and listing the 
elements of each block Bj in the relative order in which they appear in r, to reach a new 
permutation r' (such a permutation is said to be obtained from r by an inverse a-shuffie). 
Equivalently, the transition matrix K of the chain satisfies 
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where u(r, r') is the number of weak ordered partitions of [n] with a blocks, the projection 
of t on which is equal to r'. 

Let W G £4 be an intersection subspace of codimension k = codim(W, V) and let cr be 
the corresponding partition of [n], so that the number of blocks of a is equal to n — k. 



Then the right-hand side of (2.2) is equal to the probability that the following holds for a 
random weak ordered partition B of [n] with a blocks: for every pair {i,j} of elements of 
[n] belonging to the same block of a, the elements % and j also belong to the same block 
of B. This probability is clearly equal to l/a h and hence 

(2.12) X w = i/a^dimflW 



Thus it follows easily from Theorem 2.1 (i) (see [9j Equation (31)]) that the distinct eigen- 
values of K are 1, 1/a, 1/a 2 , . . . , l/a n_1 and that the multiplicity of the eigenvalue 1/a 1 is 
equal to the number of permutations in & n which have exactly n — i cycles. The stationary 



distribution ix in this case is the uniform distribution on & n and (|2.4|) gives the bound 

K-*v< g)g 

The expression on the right is bounded above by a~ c /2 if I > 21og a n + c and c > 0. In 
fact (3/2) log a n + c steps are necessary and sufficient for convergence to uniformity; see [8 J 
for further details and asymptotics. 

The chain of inverse a-shuffles converges to the uniform distribution at precisely the 
same rate as the chain of ordinary riffle shuffles on & n . Thinking of the elements of 6„ as 
linear orderings of a deck of n cards, this chain proceeds from a given ordering as follows. 
The deck is cut into a (possibly empty) packets according to the multinomial distribution 
on their sizes. Then all a packets are riffled together, each time dropping a card from one 
of the a packets with probability proportional to its size, to get to a new ordering of the 
deck. For more information and extensive discussions, see [8j[IIl|30]. □ 



Examples 2.3 and 2.4 are two of the most interesting cases of general hyperplane walks. 
Other hyperplane arrangements for which the chambers are indexed by familiar combina- 
torial objects and the associated Markov chain has a reasonably down to earth description 
appear in [T3l Section 3]. Further examples where the probabilistic analysis remains to be 
done can be found in [5], [33j [39] and [H] Lecture 5]. 

2.3. Functions of a Markov chain. Let X , Xi, X 2 , . . . be the successive outcomes of 
a Markov chain on a finite state space X. Consider a finite set 2) and a surjective map 
/ : X — y 2). We may think of 2) as a set partition of X and of the map / as the canonical 
surjection. Thus f(x) is equal to the unique block of 2) which contains x, for every x e X. 
We set Yi = f(Xi) for each index % and refer to (Yj) as the stochastic process (or chain) on 
the state space 2) which is induced from (Xi) by the map /. 

A function of a Markov chain is usually not Markov. The following lemma gives a 
necessary and sufficient condition for Markovianity in the situation described above. We 
refer the reader to [26l Sections 6.3-6.4] for a good elementary treatment. For a more 
sophisticated treatment and references, see 



10 



CHRISTOS A. ATHANASIADIS AND PERSI DIACONIS 



Lemma 2.5 (Dynkin's Criterion). Let (JQ)> be a Markov chain on a finite state space X 
and let 2) be a partition of X. The chain induced by the canonical surjection f : X — >■ 2J 
is Markov for all starting distributions for Xq if and only if for any two distinct blocks 
B , B\ G 2), the probability P(X\ G B\ \ X = x ) is constant in xo G -Bo- 
It is known that if the chain (Xi) is ergodic with stationary distribution tt, then the 
induced chain (Yj) has a limiting stationary distribution tt, given by 

(2.13) 7f(B) = J2 < x ) 

and one may inquire about rates of convergence to stationarity (even if the induced chain 
is not Markov). There has been considerable work on convergence rates in the situation 



of Example |2.3 (see [7J [21] ) and in that of riffle shuffling (see [15] for a survey and [3] for 



some recent developments and references). Further work appears in Sections [4] and [5] 

3. Main results 

This section contains our main theoretical contribution. Following a suggestion of Uye- 
mura Reyes [16] , we show that the process which is induced from a hyperplane walk on the 
set of chambers of a subarrangement is a Markov chain which is itself a hyperplane walk, 



with transition matrix easily computable in terms of the original walk (Corollary 3.2). We 



also give a new proof of the description of the eigenvalues of hyperplane walks (part (i) of 



Theorem 2.1), which uses only basic enumerative combinatorics, and a new proof of the 



basic convergence theorem (part (iv) of Theorem 2.1), which is perhaps more transparent 
than the one given in [15] . 

Throughout this section, A is a hyperplane arrangement in the vector space V = R n 
with set of chambers C4 and set of faces J 7 ^, B C A is a subarrangement with set of 
chambers and set of faces J-jg and K is the transition matrix of the hyperplane walk on 
A associated to a given probability measure w on T_\. Our starting point is the observation 
that every chamber C G C4 is contained in a unique chamber of B, which we denote by 
C. Moreover, every chamber of B contains at least one chamber of A. Thus there is a 
surjective map / : C4 — > C& defined by f{C) = C for C G C4 and hence the hyperplane 
walk on A associated to w induces a stochastic process on the state space Cg, in the sense 



of Section 2.3 The following proposition verifies Dynkin's criterion in this situation. 



Proposition 3.1. Let D,D' G C& be chambers. If C G is any chamber with C = D, 
then the sum 

(3.1) Q(C, D') = J2 K(C,C) 

C'dC A : c T =D' 

depends only on D and D' and not on the choice of C . 



Proof. Replacing K(C,C) by the right-hand side of (2.1), we find that 
(3.2) Q(C,D f ) = E w ^ = E 

C'eC A : C 7 =D' FeT A :FC=C F&F A :F~C=D' 
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Let us denote by F the unique face of B which contains F G T a- It is easy to check that 
FC = F C holds for every F G Ta- This observation and (3.2) imply that 

(3.3) Q(C,D') = 

F&F A :FD=D' 



Clearly, the right-hand side of (3.3) is independent of the choice of C. □ 

Corollary 3.2. For every starting distribution on Ca, the stochastic process induced on 
Cb from the hyperplane walk on A associated to w is Markov. Moreover, such an induced 
chain is itself a hyperplane walk on B, with associated probability measure w* on J-g defined 
by 

(3.4) w*(G) = V w(F) 



FeF A :FCG 



forGe F B - 



Proof. The first statement follows from Proposition 3.1 and Lemma 2.5 The transition 
matrix K* of the induced Markov chain on C& is given by the right-hand side of (3.1), so 
that 

(3.5) K*{D,D') = K(C,C') 

C'£C A : CCD' 

holds for D,D' G Cb, where C G Ca is any of the chambers of A contained in D. Finally, 
we note that (3.3) can be rewritten as 



(3.6) K*(D,D') = ^ w*(G), 

GeJ r B .GD=D' 



where w*(G) is as in (3.4). This proves the second statement in the corollary. □ 

The next statement summarizes the main conclusions of our discussion. 

Theorem 3.3. Let A be a hyperplane arrangement in V with set of chambers Ca and let 
w be a probability measure on its set of faces Ta- Let B C A be a subarrangement with 
set of chambers Cb and set of faces J-g and let K* be the transition matrix of the Markov 
chain on B induced from the hyperplane walk on A associated to w. 

(i) The characteristic polynomial of K* is given by 

det{xI-K*) = Y[ (x-\ w ) m w, 

W&C B 

where Cb is the intersection poset of B, \w is as in (2.<fy , 

m* w = \hb(V,W)\ = (-l) codimWy W^W0 

and fiB is the Mobius function of Cb- 

(ii) The matrix K* is diagonalizable. 



12 



CHRISTOS A. ATHANASIADIS AND PERSI DIACONIS 



111 



K* has a unique stationary distribution tt if and only if for every H G B there exists 
a face F G Tj± such that F <£. H and w(F) > 0. Moreover, if w is separating, so 
that the stationary distribution tt of the hyperplane walk on A also exists, then we 
have 



(3.7) 



TT 



(D) 



E 

CeC A :CCD 



71 



(C) 



IV, 



for every chamber D aCg. 

Assume that tt exists and let {K* D ) 1 be the distribution of the induced chain started 
from the chamber D G Cg after I steps. Then its total variation distance from tt 
satisfies 



(3.8) 



||(i^)'-7f|| TV < P [F 1 F 2 ...F l C |J H 



where (F 1; F 2 , . . . ) consists of independent and identically distributed picks from the 
measure w on Tj±, and 



(3.9) 



II (K 



DJ 



7r||TV < ^ 



A 



H ■ 



Proof. Let w* be as in Corollary |3.2 By Corollary 3J2 and Theorem 21, the characteristic 
polynomial of K* is given by the expression suggested in part (i), provided that is 
replaced by 

(3.10) \* w = 

GeTB'-Gcw 

for every W G Lb- Since every face G G J-jg is partitioned by the faces F G J- a contained 



in G, it follows from (3.4) that the right-hand sides of (2.2) and (3.10) coincide. Hence 
we have A^ = Xw for every W G Cb and part (i) follows. The remaining parts are direct 
consequences of the combination of Corollary 3.2 with Theorem 2.1 □ 



We now turn to our new proofs of parts (i) and (iv) of Theorem 2.1 The proof of part 
(i) is motivated by the combinatorial method used in [1] to determine the eigenvalues of 
some matrices, with rows and columns indexed by the oriented rooted spanning trees of a 
directed graph. A related argument which proves diagonalizability is given in [TT| Section 
8.1] jjI2J Section 3.4]. We denote by tr(A) the trace of a p x p matrix A = (ay), so that 



(3.11) 



W) = E E 



a%% x a 



l\l2 



i=l l<ii,...,ii_i<p 



holds for every positive integer I. The method of [1] is based on the following elementary 
lemma. 



Lemma 3.4. Let A 



be a p x p matrix with complex entries and let Ai, A2, 



A, 



be complex numbers. If tr(A l ) = X[ + Ar 



Ai,A 



2- 



are the eigenvalues of A. 



Ap holds for every positive integer I, then 
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Proof. We note that tr(A l ) = fi\ + /jl 2 + • • - + fJL l p holds for every positive integer I, where 
/xi,/X2, • • • ,fJ>p are the eigenvalues of A. It follows from this fact, our hypothesis and pfj 
Lemma 2.1] that the Aj are a permutation of the /ij. This proves the lemma. □ 



Proof of Theorem 2.1 (i). By Lemma [3 A\ it suffices to show that 

ti{K l ) = ™w(Xw) 1 
w&c A 

holds for every positive integer I. Using the definition of K, we see that for this matrix 



(3.11) can be rewritten as 

(3.12) tr(K l ) = E w(F l )w(F 2 )---w(F l ), 

where the inner sum ranges over all sequences (Fi, F 2 , . . . , Fi) of elements of F A of length 
I satisfying F\F 2 ■ ■ ■ FiC = C. Next we claim that for every F 6 T A we have 

(3.13) #{CeC A :FC = C} = J] W{V,W)\ = £ m w . 

W£C A : FCW W£C A : FCW 

Indeed, for a chamber C G C A we have FC — CM and only if F lies in the closure of C. The 
chambers of A with this property are in a one to one correspondence with the chambers 



of the subarrangement of A consisting of those hyperplanes which contain F. Thus (3.13) 



follows from Zaslavsky's formula [H] Theorem 2.5] (17] for the number of chambers of this 



subarrangement. Using equations (3.12) and (3.13) we find that 



tr (^) = E E E w{Fi)w(F 2 )...w{Fi) 

CeC A FdT A :FC=C F\Fi--Fi=F 

= #{CeC A :FC = C} w{F 1 )w{F 2 )-..w{F l ) 

F&F A F 1 F 2 -F l =F 

= E E mw E w(F 1 )w(F 2 ) ■■■w(F l ) 
FeT A wec A : fcw f 1 f 2 -f 1 =f 

= E mw E w(F 1 )w(F 2 )---w(F l ) 

wec A F 1 F 2 -F l CW 

= E mw E w(F 1 )w{F 2 )---w(F l ) 

WeC A FiU-UFiCW 



wec A \fcw J w&c A 



m w (X w ) 1 , 



as desired. 



□ 
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Theorem 2.1 (iv) is proved in [T3] by considering backward iteration. The following 
coupling proof is perhaps more transparent. For background on coupling we refer the 
reader to [HJ p. 84] [29l Chapter 5]. We recall that the probability measure w on the set 
of faces of A is assumed to be separating. As before, is the set of chambers of A. 



Proof of Theorem 2.1 (iv). Consider two Markov chains (Xj) and (Yi) evolving on as 
follows. The first chain starts at Xo = C and the second starts with Yq chosen from the 
stationary distribution tt. At time % the face Fi is chosen from w and is used to upgrade both 
chains; thus Xj = _FjXj_i and Yi = FjXi-i- Let T be the first time I that the hyperplanes 
of A have been separated by {F±, F 2 , . . . , Fi}, meaning that for every H G A there exists 
an index 1 < % < I such that Fi <2 H. We claim that at this time we have Xt = Yt- It 
is then clear that Xj = Yj has to hold for all j > T. Thus the two chains are coupled 



and (2.3) follows from the basic coupling inequality [HI p. 84] [2HJ Chapter 5]. Since (2.4) 



follows easily from (2.3) (see [HI p. 1839]), it remains to prove the claim. 

Consider any hyperplane H G A and choose an index 1 < i < T so that Fi % H . 
Then both chambers Xj = i<jXj_i and Yi = i^li-i lie in the same open half-space of V 
determined by H as Fi. Therefore these chambers lie in the same open half-space of V 
determined by if. It follows by induction on j that the same holds for Xj and Yj for all 
j > i and thus for j = T as well. We have shown that for every H G A, the chambers 
Xt and Y? lie in the same open half-space of V determined by H. Clearly any two such 
chambers must be equal. This proves the claim and completes the proof. □ 

Remark 3.5. As was the case in [T3], the argument in the previous proof does not require 
that faces are chosen independently from the same distribution. Any stationary process 
works as well. Nonstationary choices of face weights may be similarly handled. Then there 
may not be a stationary distribution and one needs to study "merging" [36] . 

4. Applications to hypercube walks 

Throughout this section, A stands for the Boolean arrangement in for some N, to 
be specified in each case. Specializing the choice of face weights and subarrangement gives 
a variety of natural examples. Part A treats the Ehrenfest urn of statistical mechanics. A 
spatial process driven by neighborhood attacks is studied in Part B. Part C gives a first 



treatment of the acyclic orientations chain (Example 1.2 in the introduction); the results 



are summarized in Corollary 4.1 



A. Ehrenfest Urn. Consider the Markov chain of Example 2.2 with weights wf = l/2n 
for all e G {— , +} and 1 < i < n. This is the usual nearest neighbor random walk on the n- 
dimensional cube with holding 1/2, also known as Ehrenfests' urn. The transition matrix 
K has eigenvalues j/n with multiplicity (™), for < j < n, and a uniform stationary 
distribution 7r. This walk has a small literature of its own, reviewed in [HI P- 19] [291 
Section 2.3]. As explained there, the mixing time is \n\ogn. The slightly less accurate 
bound 



\K - TT tv < n [ 1 

n 
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follows from (2.7) and shows that the total variation distance on the left is bounded above 
by e~ c if / > n(logn + c). To illustrate the speedup possible for a sub arrangement walk in 
this case, consider the subarrangement B = {Hi, H2, . . . , of the Boolean a rran gement 



A in M. n . The induced walk is a Markov chain on the set {—,+}• Theorem 3.3 implies 



that its transition matrix K* has eigenvalues (n—j)/n with multiplicity ( .), for < j < k 



and a uniform stationary distribution it. Equation (3.9) gives 

IIW-*Htv < k(i-- 

and hence the expression on the left is bounded above by e~ c if I > n(log k + c). 

B. Neighborhood Attacks. Let Q be a (finite, undirected) simple graph on the node set 
[n]. Each node of Q is labeled with either + or — . A Markov chain on the set { — , +} n of 
all 2 n possible labelings proceeds as follows. At each stage, a node of Q is chosen uniformly 
at random. The labels of this node and of its neighbors are all changed to + or all changed 
to — , with probability 1/2. The transition matrix K for this chain satisfies 



K[x, x) = 



2n 

for x,x' G { — ,+}", where fi(x,x') is the number of pairs (i,e) of nodes i G [n] and signs 
e G { — , +} for which x' is obtained from x by changing the labels of i and its neighbors in Q 
to e. Clearly, this is the chain defined by the hyperplane walk on the Boolean arrangement 
A in M. n for the following choice of face weights. For each node % G [n] and e G {— , +} we 
denote by Ff the face of A whose sign vector has j'-coordinate equal to e, if j is a neighbor 
of i in Q or j = i, and equal to otherwise. We define w(F) as l/2n times the number of 
pairs (i, e) of nodes i G [n] and signs e G { — , +} for which Ff = F (note that we may have 
Ff = F? for distinct nodes i,j G [n]). Ehrenfests' urn occurs as the special case in which 
Q has no edges. 

For S C [n] we denote by a(S) the number of nodes of Q which are neither equal nor 



adjacent to any of the nodes in S. It follows from (2.5) that K has eigenvalues j /n, with 
multiplicity equal to the number of subsets S C [n] with a (S) = j, for < j < n and 
that for 1 < i < n, the eigenvalue contributed by the hyperplane Hi of A is equal to 
1 — (di + 1)/ti, where di is the degree of node i in Q. The stationary distribution 7r for 



this example exists for every graph Q but is hard to compute in general. Inequality (2.3) 
bounds the total variation distance \\K l x — 7t||tv from above by the probability that 

1 

U * w. 

1=1 

where nodes v±, . . . , vi are picked independently and uniformly from [n] and N(v) stands 
for the set of nodes of Q which are either adjacent or equal to v. To compute this probability 
is a classical problem, called the "coverage problem"; see, for instance, [2JE1I2Z!- Similarly, 
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the eigenvalue bound (2.4) becomes 



(4.i) ii^-vtiitv < £ I 1 - dl + lXl 



n 



For instance, if Q is the complete graph on the node set [n] , then di — n — 1 for all i and 
the walk becomes random after exactly one step. 

The eigenvalue bound is not perfect. For instance, consider a "star graph", having one 



central node of degree n — 1, and n — 1 leaves of degree one. The right-hand side of (4.1) 
becomes (n — 1)(1 — 2/n) 1 and shows that order of n log n steps suffice. On the other hand, 
the coverage bound is bounded above by (1 — 1/n) 1 , which is the chance of missing the 
central node in the first I steps. This implies that order of n steps suffice. An elementary 



argument shows that this is the correct answer. For a general graph Q, (4.1) implies that 



\\K l x — 7T || tv < e~ c if / > ^-j-(logn + c), where d is the largest of the degrees di. 

To estimate the time it takes for a subset of nodes, say {1,2,..., A;}, to equilibriate, 
consider the subarrangement B = {Hi, H 2 , . . . , Hk} of A and note that ( 3.9| ) becomes 



iiw-hitv < t o-^y 



This offers a range of possibilities to illustrate the speedup possible; we leave further details 
and examples to the interested reader. One can also deduce easily from Theorem |3.3| that 
the transition matrix K* of the induced chain has eigenvalues a(S) / n, one for each S C [k]. 

The previous situation can be easily varied; the nodes can be chosen with varying prob- 
ability, the size and shape of the neighborhood can be allowed to fluctuate and the chance 
of + or — need not be symmetric. With such freedom, the stationary distribution becomes 
intractable but it is still staightforward to give upper bounds for the total variation dis- 
tance to stationarity. Lower bounds are harder to achieve, due to our lack of knowledge of 
the stationary distribution. 

C. Orientations. Let Q be a (finite, undirected) simple graph on the node set [n] with 
m edges. An orientation of Q is an assignment of a direction i — > j or j — > i to each edge 
{i, j} of Q. We will denote by 0(G) the set of all orientations of Q. This set is in bijection 
with { — , +} m and hence with the set of chambers of the Boolean arrangement A in R m . 
To be more specific, let Eg = {ei,e2, . . . ,e m } be the set of edges of Q, equipped with a 
fixed linear ordering of its elements, and let us identify an orientation o G 0(Q) with the 
sign vector (si, 62, ... , e m ) G { — , +} m for which 



— , if is directed as i — > j in and i < j 
+, if Ck is directed as i — » j in and i > j 

for 1 < k < m, where = {i,j}. Thus any hyperplane walk on A defines a Markov 



chain on O(^). A choice of face weights which gives Example 1.2 of the introduction is the 
following. Let Wi, W2, ■ ■ ■ , w n be nonnegative real numbers summing to 1. For 1 < i < n, we 
denote by Fj the face of A whose sign vector has kth coordinate equal to +, if = {i, j} 
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with i < j, to — , if Ck = with i > j and to 0, if is not incident to i. We let 

w(Fi) = Wi for each node % G [n] which is not isolated in Q and w(F) = for all other 
nonzero faces of A, and define w(F) as the sum of Wi over all isolated nodes i G [n] of G, 
if F is the zero face of A. 

The resulting chain on 0(G) proceeds from a given orientation by selecting the node i 
of G with probability Wi and reorienting all edges of Q incident to this node towards itself, 
to reach a new orientation of G, leaving the orientations of all other edges of Q unchanged. 
Equivalently, the transition matrix K of this chain on 0(G) satisfies 

(4.2) K{o,o') w * 

ie[n]: Pi{o)=o' 

for o, o' G 0(G), where Pi(o) denotes the orientation of Q obtained from o by reorienting 



towards i, as just described. We collect the consequences of Theorem 2T for this example 
in the following statement. We denote the stationary distribution by II to avoid confusion 
with the notation of Section [5j where acyclic orientations of Q are considered and 7r has 
a different meaning. A subset T of the set of nodes of Q is said to be dominating in Q if 
every edge of Q is incident to at least one node in T. 

Corollary 4.1. Let Q be a simple graph on the node set [n] and let Eg = {ei, 62, ■ ■ ■ , e m } be 



the set of edges of Q . The following hold for the chain (4-2) on the set 0(G) of orientations 
ofG: 

(i) The matrix K is diagonalizable with eigenvalues 

(4.3) \ s = Wi > 

i£N s 

one for each S C [m], where Ns is the set of nodes i G [n] which do not belong to 
any of the edges G Eg with k G S. 

(ii) K has a unique stationary distribution II if and only if there is no edge G Eg 
such that Wi = Wj = 0. 

(iii) Assume that IT exists and let K l g be the distribution of the chain started from the 
orientation G 0(G) after I steps. We have 

(4.4) — Iljjxv — P ({vi,v 2 , ■■■ ,vi} is not dominating in G) , 

where (v±,V2, ■ ■ ■) consists of independent and identically distributed picks from w, 
and 



-Wi - w jf 



(4.5) ll^-n|| TV < E ( x 

In particular, we have 

(4.6) ll^o-nlkv < m(l 1 



n 



if wi = ■ ■ ■ = w n = 1/n. 
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Proof. Parts (i) and (ii) follow directly from Theorem 2.1 and the relevant discussion in 



Section 2.2 Part (iii) follows from Theorem 2.1 (iv), since a product of faces of A of the 



form F v is a chamber if and only if the corresponding set of nodes v is dominating in Q 
and since \{e k } = 1 — Wi — Wj is the eigenvalue corresponding to the hyperplane Xk = of 
A associated to the edge ek = of Q. □ 

Example 4.2. Let n = 2d be even and consider the graph Q with edges {l,d + 1}, 
{2, d + 2}, . . . , {d, 2d}. The set of orientations of Q can be identified with { — , +} d , where 
the ith coordinate of a sign vector is equal to + or — if the edge {i, d+i} is directed towards 
i or towards d + i, respectively, in the corresponding orientation. The chain proceeds, at 
each stage, from the current sign vector by picking a coordinate % and switching it to + 
(respectively, — ) with probability Wi (respectively, Wi+d)- Clearly, this chain concides with 



the nearest neighbor random walk of Example |2.2| on the vertex set of the <i-dimensional 
cube, where Wi and Wi+d have the roles played by wf and w~ , respectively, in that example. 

□ 

We postpone the description of the stationary distribution for a general graph Q until 
Section [5] (see Proposition 5.4), where more examples also appear. 



5. Applications to permutation walks 

Throughout this section, A stands for the braid arrangement in W 1 . A subarrangement 
of A is specified by a simple graph Q on the node set [n]. It is first shown that every 
hyperplane walk on A induces a walk on the set of acyclic orientations of Q (Proposition 



5.1). Specializing to the Tsetlin library walk in Part A gives again the walk on acyclic 



orientations of Example 1.2 We give a detailed discussion, determining the eigenvalues, 



stationary distribution and rates of convergence. A birth and extinction example shows 



that the coupling bound (3.8) can be much better than the eigenvalue bound (3.9). Part 
B shows how various aspects of a permutation behave after successive riffle shuffles. This 
yields a probabilistic interpretation for the coefficients of the chromatic polynomial of a 
graph. Descents of permutations of various types are also treated. 

Acyclic Orientations. Let Q be a simple (undirected) graph on the node set [n]. Given 
an orientation o of Q, a directed cycle in o is a sequence of nodes (io, i\, . . . , if.) such that 
io — > i\ — > ■ ■ ■ — > ik — > io in o. An orientation of Q is acyclic if it contains no directed cycles. 
An acyclic orientation o of Q specifies a partial order ^ on the set [n] by letting a ^ b 
if there exists a directed walk in o with initial node a and final node b. Conversely, every 
partial order ^ on [n] comes from a graph on the node set [n] in this way (for instance, 
from the Hasse diagram of ^). 

The acyclic orientations of Q can be modeled by the chambers of a hyperplane arrange- 
ment as follows. The graphical arrangement corresponding to Q is the subarrangement B 
of the braid arrangement A in IR n , consisting of all hyperplanes of the form Xi — Xj = 
for which {i,j} is an edge of Q. A chamber D E C& defines an acyclic orientation of Q 
by assigning the direction j — > i to the edge {i,j} of Q if > Xj holds for every point 
(xi,x 2 , ■ ■ ■ ,x n ) G D. The resulting map is a bijection from the set of chambers Cg to the 
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set of acyclic orientations of Q, henceforth denoted by AO(Q); see |JU Section 2.3] for a 
proof and further information. Using this bijection, we may identify chambers of B with 
the corresponding acyclic orientations of Q. 

It follows from the previous discussion that every hyperplane walk on A induces a Markov 
chain on the set AO(£?), as described in Section[3] We record this conclusion in the following 
proposition. We recall that a permutation r e & n is said to be a linear extention of a 
partial order ^ on [n], if for all a,b G [n] with a -< b we have r~ 1 (a) < t _1 (6), meaning 
that a appears before b in the linear ordering (t(1), t(2), . . . , r(n)) associated to r. 

Proposition 5.1. Every hyperplane walk on the braid arrangement inM. n induces a Markov 
chain on the set AO(Q) of acyclic orientations of Q . If the original walk has a unique 
stationary distribution ir, then the stationary distribution n of the induced chain is given 
by 

(5.1) tt(o) = < T ) 

re£(o) 

for o G AO(Q), where S(o) is the set of linear extensions of the partial order on [n] defined 
by o. In particular, if tc is the uniform distribution on & n , then 

(5.2) »(.) = 
for every o G AO(Q). 



nl 



Proof. Let A denote the braid arrangement in M. n and B denote the graphical arrangement 
corresponding to Q, as before. As already mentioned, the first statement follows from the 



previous discussion and Corollary 3.2 The second statement follows from (3.7) and the 
observation that for chambers C G C4 and D G C& corresponding to the permutation 
r G & n and the acyclic orientation G AO(£?), respectively, we have C C D if and only if 
re 5(o). □ 

Remark 5.2. Acyclic orientations are of importance in various areas of applied mathemat- 
ics, such as computer science, automata theory and statistics. In statistical applications 
they appear as part of the machinery of "Bayes nets" and "casual models", where they 
are used to model casual implication in complex data sets; some useful references are 
[221 12H1 [21] ■ Searching for an appropriate model is often done by a random walk on acyclic 
orientations. We hope that our analysis will contribute to the understanding of these al- 
gorithms. For an introduction to the literature relating acyclic orientations to factoring 
noncommutative polynomials, see [31]. □ 



In the remainder of this section we investigate further the Markov chain of Proposition 
5.1 in the special cases of Examples 2.3 and 2.4 Note that the case of Example 2.3| is also 



treated by the Markov chain (4.2) 



A. Tsetlin Library. Let w\,W2, ■ ■ ■ ,w n be nonnegative real numbers summing to 1 and 



let w be the probability measure on J 7 ^ of Example 2.3 Thus the associated hyperplane 



walk on A is the Markov chain on & n which selects, at each stage, the entry i in the one 
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line notation of the current permutation with probability Wi and moves it in front. To 



describe the induced chain of Proposition 5.1 on the set AO(Q), we observe the following: 



if C G Ca is the chamber which corresponds to a given permutation r G & n and D G Cq is 
the unique chamber of B which contains C, then the acyclic orientation of Q corresponding 
to D is the one which orients an edge {a, b} of Q as b — > a if and only if r~ 1 (a) < r _1 (6). 
It follows that the induced chain on AO(Q) proceeds from a given acyclic orientation by 
selecting the node i of Q with probability w^ and reorienting all edges of Q incident to this 
node towards itself, to reach a new acyclic orientation of Q, leaving the orientations of all 
other edges of Q unchanged. Equivalently, if K* is the transition matrix of the induced 



chain on AO(£?), then A^*(o, o') is given by the right-hand side of (4.2) for o, o' G AO(Q). 

Given a subset T of the node set of Q, we denote by Q \ T the graph obtained from Q 
by removing all nodes in T and all incident to them edges (in other words, Q \ T is the 
induced subgraph of Q on the node set [n] \ T). Thus T is dominating in Q if and only if 
the graph Q \ T has no edges. The following statements can be added to the conclusions 



of Proposition 5.1 



Proposition 5.3. Let Q be a simple graph on the node set [n] and let K* be the transition 
matrix of the Markov chain on AO(Q) which is induced from the Tsetlin library with weights 
w 1 ,w 2 , ...,w n . 

(i) The matrix K* is diagonalizable with characteristic polynomial given by 

(5.3) det(xI-K*) = JJ (x - X s ) ms , 

SC[n] 

where 

(5.4) A 5 = J2 



Wi 



and 



(5.5) m s = (" 1 )' Tx51 #AO(£\T) 

SQTC[n] 

for S C [n], where the number of acyclic orientations of the graph with empty node 
set is equal to one, by convention. 
(ii) K* has a unique stationary distribution ff if and only if there is no edge of Q 

such that Wi = Wj = 0. Moroever, we have 

W T (l)W T (2) ■ ■ ■ W T ( n ) 



no 



E 



7 _,, f0l (1 - W T (i))(l - W t(1) - W t{2 )) ■••(!- W t{1) Wr(n-l)) 



for o G AO(£?), ifwi, w 2 , ■ ■ ■ , w n are all positive. In particular, (5.2) holds for every 
o G AO(£?) if w 1 = ■ ■ ■ = w n = 1/n. 



(hi) Assuming that t\ exists, the conclusions of Corollary 4-1 (Hi) hold if it is replaced 
there by ft and K\ is replaced by the distribution (K*) of the induced chain started 
from the orientation o G AO(C?) after I steps. 
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Proof. We first recall from [HI Section 2.3] the following description of the intersection 
poset Cq of the graphical arrangement B C A corresponding to Q. A set partition n of [n] 
is said to be Q -connected if the induced subgraph of Q on each block of 7r is connected. We 
denote by Cg the set of (/-connected partitions of Q, ordered by refinement. This poset, 
known as the "bond lattice", or "lattice of contractions" of Q, is isomorphic to C&, where 
the isomorphism is induced by the one between the lattice of partitions of [n] and £.4, 
discussed in Section 12.21 

Given W G we write A CT for the eigenvalue Xw of K* which appears in Theorem 



3.3 where a G Cg is the (/-connected partition corresponding to W. The definition of the 
measure w on J 7 ^ of Example 2.3 and the definition of Xw in (2.2) imply that 



(5.6) 



X a 



W: 



where the sum runs over all singleton blocks {i} of a G Cg. Theorem 3.3 (i) gives 

det{xI-K*) = Yl (x-X a ) m °, 

cr&Cg 

where m* = \fig(0, a)\ and fig is the Mobius function of Cg. The previous two equations 



imply that (5.3) holds if we define 



m s 



E 



|j"e(6,e>-) 



crGCg: sing(<r)=S 



where sing(a) = {i G [n] : {i} G cr} denotes the set of singleton blocks of a. To complete 
the proof of part (i), it remains to prove ( |5.5[ ). By inclusion-exclusion we can write 

(5.7) m s = (" 1 )' TX51 



SCTC\n\ 



|^(6, (X) 



where 

<r£Cg: TCsing(cr) 

Clearly, writing a = {Bi, B 2 , . . . , Bk}, the closed interval [6, a] in Cg is isomorphic to the 
direct product of the lattices Cg i for 1 < i < k, where Qi is the induced subgraph of Q 
on the node set Bi. It follows easily from this observation and the multiplicativity of the 
Mobius function [3H1 Proposition 3.8.2] that 

(5.8) n T = I/^xt(M|- 



By Zaslavsky's formula Theorem 2.5] [17], the right-hand side of (5.8) is equal to the 
number of chambers of the graphical arrangement corresponding to Q \ T and hence to 



Part (ii y 

Part (iii) follows from Theorem 3.3 
ordered partitions of the form (z, [n] \ 



the number of acyclic orientations of Q \ T. Thus (5.5) follows from (5.7) and (|5.8|). 
follows from Theorem 3.3 (iii), equation (2.9) and Proposition 5.1 

iv 



since a product of faces of A corresponding to 
{i}) is not contained in any of the hyperplanes of B if 
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and only if the corresponding set of nodes V{ is dominating in Q and since Xh = 1 — W{ — Wj 
holds for the hyperplane H of B corresponding to the edge {i, j} of Q. □ 



Part (ii) of Proposition 5.3 and the following statement determine the stationary distri- 



bution of the Markov chain on the set 0{Q) of all orientations of Q, discussed in part C of 
Section |4} We note that if Q is a forest, then every orientation of Q is acyclic and hence 
the two Markov chains on 0(G) and AO(Q) coincide. 

Proposition 5.4. Consider the chain on the set 0(Q) of all orientations and the chain on 
the set AO(£?) of acyclic orientations of Q, with weights W\, w 2 , ■ ■ ■ , w n . Assuming there is 



no edge {i,j} of Q such that Wi 
7i are related by 



(5.9) 

for o G 0(G). 



Tl(o) 



Wj = 0, their respective stationary distributions II and 

7f (o), if o is acyclic 
0, otherwise 



Proof. We denote by K the transition matrix of the chain on 0(Q) and recall that n(o) = 
lirn^oo K l (o*, o) for o G 0(Q), where the limit is independent of the starting orientation 
o* G 0(G). Choosing o* G AO(£?), all orientations in the chain stay in AO(£?) and the limit 
becomes equal to the right-hand side of (5.9). □ 



Example 5.5. Fix an integer 1 < k < n and let Q be the graph with edges {i,j} for 
1 < i < j < k. The set AO(£?) can be identified with the subgroup &k of permutations in 
& n which fix the set {k + 1, . . . , n} pointwise and the induced chain is the process which 
records the relative ordering of {1, 2, ... , k}, when r G & n evolves as in the Markov chain 
of Example |2.3| The eigenvalues and stationary distribution tt of the transition matrix K* 
can be easily deduced from those of the transition matrix K of the parent chain, since in 
this case K* differs by a multiple of the identity matrix from the restriction of K on &k- 



For instance, K* has eigenvalues (2.8), one for each r G &k- By Proposition 5.3 (iii), the 



bound from (4.5) applies and gives 



In particular, if W\ 



(5.10) 



l<i<j<k 

Wk — 1/fc, so that 7r is uniform, then 



||(^y-7f||TV < 



n 



which is an improvement over (2.11). Thus the left-hand side of (5.10) is bounded above 
by e~ 2c /2 if / > n(lo gfc + c). 

The bound (5.10) is quite sharp across the whole range of k. For instance, if k = 2 



it shows that I must grow as cn, with c approaching infinity. This is correct since if c 
stays bounded, then there is a nonzero chance that neither 1 nor 2 has been moved, and 
thus that they have stayed in their original relative order. At the other extreme, we have 
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already commented in our discussion of (2.11) that the bound is sharp if k = n. Similar 
remarks hold for other values of k. □ 



The following example gives a concrete case in which the bound of (3.8) is better than 



that of (3.9). 



Example 5.6 (Birth and Extinction). Let Q be the graph with edges {i, n} for 1 < i < m, 
where m < n — 1 is a positive integer, and choose weights W\ — • ■ • — w n — 1/n. Every 
orientation of Q is acyclic and hence the set AO{Q) can be identified with { — , +} m , as 
described in part C of Section |4} The chain proceeds, at each stage, from a sign vector 
x G { — , +} m by picking a coordinate i uniformly at random and switching this coordinate 
to +, if 1 < % < m, leaving x unchanged, if m + 1 < i < n, and switching all coordinates 
of x to — , if i — n, to reach a new sign vector. Such processes are studied in mathematical 
genetics with many variations. 

From our current point of view, we may think of this chain as the process which records 
the subset of [m] consisting of those integers which precede n in the current permutation r, 
when r G & n evolves as in the Markov chain of Example 2.3 with uniform weights (random 
to top model). This is because a number i G [m] precedes n in some (equivalently, every) 
linear extension of the orientation o if and only if the edge {i, n} is directed as n — > i in o. 

Proposition 5.7. Consider the Markov chain on AO(Q), which is induced from the Tsetlin 
library with uniform weights, as a chain on the set {— , +} m and let K* be its transition 
matrix. 

(i) The matrix K* is diagonalizable with eigenvalues 



1. 



n 



(5.11) 



m 



with multiplicity one 
with multiplicity (™) ; for 1 < j < m. 

The stationary distribution of K* is given by 

for x G { — , +} m , where k is the number of coordinates of x equal to +. 

Assume m = n — 1 and let (K*) 1 be the distribution of the chain started from x 

after I steps. We have 

i 



(5.12) 



IIM'-^IItv < (i- 



n 



< e 



for I > cn and c > 0. Moreover this bound is sharp, in the sense that there exists 
< 9 < 1 such that \\(K*) n - 7r|| TV > 6 for all large n. 



Proof. Part (i) follows from Corollary 4.1 (i). Alternatively, it follows from the proof 



of Proposition 5.3 (i) and, in particular, equation (5.6), since all values of the Mobius 
function \ig in this case have absolute value 1. Part (ii) follows from Proposition 5.3| (ii) 
and equation (5.2), which applies in our situation, since the number of linear extensions 
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of the poset on [n] defined by any orientation of Q with k edges pointing away from n is 
equal to ( m +J k\(m — k)\{n — m — 1) 



Assuming that m = n—1, (5.12 ) follows from the bound given in (4.4), since a dominating 
set in Q is formed as soon as node n is picked and the chance that this has not happen 
in the first I steps is equal to (1 — l/n) 1 . Finally, suppose that the starting sign vector x 
has all its coordinates equal to — and let A be the set of all y G {— , having at least 

(n — l)/2 coordinates equal to +. An elementary calculation shows that after n steps in 
the chain, the expected number of + coordinates is equal to 



nil A \ e 2 



It follows that (K*) n (A) — > as n — > oo, while clearly it (A) > 1/2. Since the total variation 
distance ||(iT*) n — 7t||tv is bounded below by \(K*) n (A) — 7f (A) | , we conclude that given 
any < 6 < 1/2 we have ||(i^*) n — 7t||tv > $ f° r n large enough. This completes the proof 
of part (iii). A similar argument works for all 2 < m < n — 1. □ 

Example 5.8 (Descent Set). Let Q be the path with edges {i, i + 1} for 1 < i < n — 1 and 
choose weights W\ — ■ ■ ■ — w n — l/n. Once again, every orientation of Q is acyclic and 
hence the set AO(£?) can be identified with the set of sign vectors {— ,+} n ~ 1 . We leave 
it to the reader to give a description of the evolution of this chain on the set { — , +} n_1 



similar to that of Example 5.6 



We find it more convenient to identify AO(^) with the set of subsets of [n — 1], where an 
orientation o of Q is identified with the set of indices i 6 [n— 1] for which the edge {i, i + 1} 
is directed as i — > i + 1 in o. We denote by B the graphical arrangement associated to Q, 
as usual, and recall that there is a directed edge i — > i + 1 in o if and only if x« < Xj+i 
holds in the chamber D of B corresponding to o. In turn, this happens if and only if % + 1 
precedes i in any of the permutations r which correspond to chambers of A contained in 
D or, equivalently, if and only if % belongs to the descent set 

Desfr- 1 ) = {% G [n - 1] : r-\i) > r~ l {i + 1)} 

of the inverse permutation r~ l . Therefore, our chain on the set of subsets of [n — 1] is the 
process which records the descent set Des(r _1 ), when r G & n evolves as in the Markov 



chain of Example 2.3 with uniform weights. We recall that a composition of n is an ordered 



sequence of positive integers (called parts) which sum to n. 

Proposition 5.9. Consider the Markov chain on AO(Q), which is induced from the Tsetlin 
library with uniform weights, as a chain on the set of subsets of [n — 1] and let K* be its 
transition matrix. 

(i) The matrix K* is diagonalizable with eigenvalues j/n for j G {0, 1, . . . , n — 2}U{n}, 
where the multiplicity of j/n is equal to the number of compositions of n having 
exactly j parts equal to 1. 

(ii) The stationary distribution of K* is given by 

(5.13) tx(S) = — # {r G &„ : Des(r) = S} 
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for every S C [n — 1]. 
fiii) We have 



||(^-7f||TV < (n-l)(l-|y, 
where (K s ) is the distribution of the chain started from S after I steps. 



Proof. Part (i) follows once again from Corollary 4.1 
i) . For part 



(i), or from the proof of Proposition 
n — 1] with corresponding orientation 
n] defined by o is in bijection 



5.3| (i). For part (ii) it suffices to note that given S C 
o G AO(Q), the set of linear extensions of the partial order on 
with the set of elements of & n with descent set equal to S, as already discussed before the 

ii). 



is a consequence of (4.6). 



statement of the proposition. Then (|5.13|) follows from (|5.2|) and Proposition 5.3 
(iii 



Part 
□ 



Part (iii) of this proposition implies that for c > 0, the distance || (Kg) — 7t||tv is bounded 
if I > f (log n + c). This can be shown to be sharp, in the sense of Proposition 



above by e 
iii 



5.7 



by an argument similar to the one in the proof of this proposition. 



□ 



Example 5.10 (Cyclic Descent Set). For notational convenience, in this example we re- 
place the node set [n] by the abelian group Z n of integers modulo n. We let Q be the 
cycle with edges {i, i + 1} for 1 < i < n and choose weights ui\ = ■ ■ ■ = w n = 1/n. Since 
there are exactly two orientations of Q which have a directed cycle, the number of acyclic 
orientations of Q is equal to 2 n — 2. We may identify AO(£/) with the set of proper subsets 
of [n] , where an acyclic orientation o of Q corresponds to the set of indices % G [n] for which 



the edge {i, i + 1} is directed as i — > i + 1 in o. Arguing as in Example 5.8 , we see that this 
chain on the set of proper subsets of [n] is the process which records the cyclic descent set 



cDes(r 



{ie[ 



ii 



U > T 



(i + 1)} 



when r G & n evolves as in the Markov chain of Example 2.3 with uniform weights. Cyclic 
descents of permutations were introduced by Cellini [TH] and further studied by Fulman; 
see [23] and references therein. 

The bond lattice Cg is isomorphic to the set of subsets of [n], other than those of 
cardinality n — 1, partially ordered by inclusion. Since this lattice has a well known Mobius 



function, one can deduce easily from (5.6) the following description of the eigenvalues of 



the transition matrix K* of this chain. They are the numbers j/n for j 6 {0,1,..., n} and 
for j > 1, the multiplicity of j/n is equal to the number of set partitions of Z n into blocks 
of the form {a, a + 1, . . . , b} having exactly j singleton blocks. The multiplicity of zero is 
two less than the number of such partitions of Z n having no singleton block. Arguing as 
in Example 5.8, we find that the stationary distribution of K* is given by 

' # {r G G n : cDes(r) = S} 



(5.14) 7f(5) 
for proper subsets S C [n] and that 

(5.15) \\(K*) 1 



itl 



TV 



< nil 



n 
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whe re (K s ) is the distribution of the chain started from S after I steps. As in Example 
it follows that IK-fCj)' — vt||tv is bounded above by e~ c if I > ~(logn + c). □ 



5.8 



B. Inverse a-shufning. Let a > 2 be an integer and let w be the probability measure 



on Ta °f Example 2.4, so that the hyperplane walk associated to w is the Markov chain 



of inverse a-shufHes on & n . Using similar reasoning to the one in the case of the Tsetlin 



library, one can describe the induced chain of Proposition 5.1 on the set AO(Q) as follows: 
The chain proceeds from a given acyclic orientation of Q by selecting uniformly at random 
a weak ordered partition B = (B±, B2, . . . , B a ) of [n] with a blocks. Then the orientation 
of any edge of Q whose endpoints belong to the same block of B is left unchanged and 
any other edge {u, v} of Q is reoriented as u — > v, if i < j holds for the unique indices i 
and j with v G Bi and u G Bj, to reach a new acyclic orientation of Q. We will refer to 
the induced chain as the chain of inverse a-shufHes on AO(£?). Its transition matrix K* 
satisfies 

K (0,0) = 



for 0, o' G AO(^), where u*(o,o') is the number of weak ordered partitions of [n] with a 
blocks, the action of which on 0, just described, results in o'. 

We denote by \Q t ne chromatic polynomial [2J Section 2.3] of Q. Thus for every positive 
integer q, Xg(q) is equal to the number of colorings k : [n] — > [q] of the nodes of Q with q 
colors satisfying k(u) 7^ k(v) for every edge {u,v} of Q. Part (i) of the following corollary 
provides an interpretation to the coefficients of xg which strengthens a theorem of Stanley 
[37] [ill Corollary 2.3], stating that the sum of the unsigned coefficients of \g is equal to the 
number of acyclic orientations of Q. There are other interpretations to these coefficients; 
see, for instance, (HJ Theorem 4.12] and [231 14*4"] . 

Proposition 5.11. Let Q be a simple graph on the node set [n] and let K* be the transition 
matrix of the Markov chain of inverse a-shufftes on AO(Q). 

(i) The matrix K* is diagonalizable with characteristic polynomial given by 

n—l 1 

det(xI-K*) = Y[ (x--) p \ 
i=o a 

where 



n-l 



Xg(q) = Yl (-^P^ 



i=0 

is the chromatic polynomial of Q. 



(ii) The stationary distribution it of K* is given by (5.2). 
(hi) We have 



(5.16) ||(A7y-7r|| T v < m 



where m is the number of edges of Q and (K*) 1 is the distribution of the chain 
started from the acyclic orientation o, after I steps. 
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Proof. Let B denote the graphical arrangement corresponding to Q, as usual, and /x# denote 
the Mobius function of the intersection poset £g. It follows from Theorem 3.3 (i) and (2.12) 
that the distinct eigenvalues of K* are 1, 1/a, 1/a 2 , . . . , l/a n ~ l and that the multiplicity pi 
of the eigenvalue 1/a 1 satisfies 



Pi 



WeC B : codim(W,F)= 



f -iy^ B (v,w). 



Equivalently, (— is equal to the coefficient of q n ~ % in the characteristic polynomial [311 
Section 2.3] (HI Section 1.3] of B, which is known to equal Xq{q) PU Theorem 2.7]. This 
proves part (i). Part (ii) follows from Proposition 5.1 
converges to the uniform distribution on & n . Part (iii 
for every H G B. 



since the chain of inverse a-shuffies 
follows from (3.9), since Xh — 1/a 

□ 



For the applications discussed in the following example, we think of & n as the set of 
linear orderings of a deck of n cards, labeled by the elements of [n]. Since inverse a- 
shuffling, followed by passing to the inverse of the current permutation, gives the same 
distribution as ordinary a-shuffiing, there is a straightforward translation of our results 
into the language of a-shuffles. 



Example 5.12. (i) Suppose that Q is the star of Example 5.6, say with n — 1 edges. The 



chain of inverse a-shuffies on AO(Q) is the process which records the set S C [n — 1] of the 
labels of cards which precede card n in the current linear ordering, in the chain of inverse 
a-shuffies on & n (this is stronger than just recording the current position of card n; see [3] 
for a summary of results on that Markov chain). 

Since Q is a tree, its chromatic polynomial is given by Xoil) = q{q ~~ l) n_1 and hence, 
by Proposition 5.11 (i), the matrix K* has eigenvalues 1/a* with multiplicity ( n T ), fo r 
< i < n - 1. 
where m 



n 



The stationary distribution n(S) is given by the right-hand side of (5.11) 
1 and k is the number of elements of S. Moreover, (5.16) gives 



(5.17) 



IK* 



ttIItv < {n - 1) 



and hence the total variation distance on the left is bounded above by a~ c if I > log a n + c, 
for c > 0. This shows a speedup over the (3/2) log a n + c, required for the parent chain of 
inverse a-shuffles on & n to reach stationarity, and is essentially sharp by the results of [3, 
Section 2]. 



(ii) Suppose that Q is the path of Example 5.8 The induced chain records the descent 
set Des(r _1 ) of the inverse of the current permutation r in the chain of inverse a-shuffles 
on & n . The stationary distribution is given by (5.13). Since the path Q is also a tree, the 



description of the eigenvalues for the star example and (5.17) continue to hold. The result 



on the rate of convergence in this case was obtained earlier in (TTJ Section 3], where it is 
also shown that (l/2)log a n + c steps are necessary and sufficient for Des(r _1 ) to reach 
stationarity if n is large. 

(iii) Suppose that Q is the cycle of Example 5.10 The induced chain now records the 
cyclic descent set cDes(r~ 1 ) of the inverse of the current permutation r in the chain of 
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inverse a-shuffles on & n . It follows, in particular, that this process is a Markov chain 
on the set of proper subsets of [n]. As in the previous case, we find that the stationary 



distribution is given by (5.14) and that 



||(iry-7f|| TV < n(±) 



so that IK-K"*)' — vr|| TV is bounded above by a c if I > log a n + c, for c > 0. We leave further 
details to the interested reader. □ 

Remark 5.13. The difference between ordinary and inverse a-shuffles is easy to appreciate 
by considering the graph with a single edge {1,2}. Then the induced process records the 
relative order of cards labeled 1 and 2. After fewer than log a n ordinary a-shuffles, there is 
a good chance (close to 1) that they are still in their original order. However, their relative 
order is close to random after a growing number of inverse a-shuffles. □ 

6. Semigroup walks 

The theory of hyperplane walks was generalized to random walks on semigroups by 
Brown [IIJ[I2]- This section shows how our main results can be extended in this direction. 
Some familiarity with the ideas of [TTIITS] will be assumed. The algebraic aspects of Brown's 
theory of semigroup walks have been further studied in [U H2J 03], with probabilistic 
developments in [19]. These references contain examples to which the following theory 
may be applied. 

The face semigroup of a hyperplane arrangement A is defined as the set F^ of faces of A, 
endowed with the product operation discussed in Section [2] The set C4 of chambers is a left 
ideal of F^, meaning that it is a nonempty subset which is closed under left multiplication 
by elements of Fj, (of course, C4 is a right ideal as well). Given a finite semigroup S, a 
left ideal C of S and a probability measure w on S, one can define a Markov chain on the 
state space C with transition matrix K given by 

(6.1) K(c,c') = Yl w ^ 

xSiS: xc=c' 

for c, d G C. We refer to this chain as the semigroup walk on C associated to w; it coincides 
with the hyperplane walk on A associated to w, if S = Fa and C = C4. The semigroup S 
is said to be a band if x 2 = x for every x 6 S. To such a semigroup, one can associate a 
join semilattice L and a surjective map supp : S — > L, satisfying 

(6.2) supp(x) < L supp(y) y = yxy 

for x,y G S; see [IS Section A. 2] for further details. The support map has the additional 
property that 

(6.3) supp(xy) = supp(a;) V supp(y) 

for x,y G S, where u V v denotes the least upper bound (join) of u and v in L. In the 
special case of a face semigroup J 7 ^, the support of a face F G Tj± is the linear span of 
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F and the semilattice L is the dual of the intersection poset Cj±, defined in Section [2j A 
band S is called left-regular if xyx = xy for all x, y G S. 

Assume that S is a finite band. Then L is a fi nite join semilattice and hence it has a 



maximum element, denoted 1. It follows from (6.3) that the set Cs = {c G S : supp(c) = 1} 
is a left ideal of S. The elements of Cs are called chambers. Given x G S, the subsemigroup 
xS is a finite band whose number of chambers depends only on the support u = supp(a;) 
of x in L; see [121 Section B.3]. We denote this number by n{u). The following theorem 
was proved for left-regular bands in |TT] and generalized to all bands in [ 1 2J . 



Theorem 6.1. Let S be a finite band with corresponding semilattice L and set of chambers 
Cs- Let w be a probability measure on S and let K be the transition matrix of the semigroup 
walk on Cs associated to w. 

(i) The characteristic polynomial of K is given by 
(6.4) det(xI-K) = Y[(x-X u ) m \ 



where 

(6.5) \ u = ^2 w ( x ) 

x£S: s\xpp(x)<LU 

is an eigenvalue, 

(6.6) m u = 22 HL{u,v)n(v), 



U< J,V 



Hi, is the Mobius function of L and n(v) is the number of chambers of xS for any 
x G S with supp(x) = v. 
(ii) The matrix K is diagonalizable. 

(hi) If the set {x G S : w(x) > 0} generates S, then K has a unique stationary distri- 
bution it and 

(6.7) II^c-^IItv < P{x 1 x 2 ---x l ^C s } < J2 A «' 

u 

where K l c is the distribution of the chain started from c G Cs after I steps, (xi,x%, . . .) 
consists of independent and identically distributed picks from the measure w on S 
and u runs through the set of elements of L covered by 1. 

The results of Section [3] can be extended to this setting as follows. Let (p : — > be 
an epimorphism of finite semigroups, meaning that <p is a surjective map which satisfies 
<j)(xy) = <f)(x)<f>(y) for all x, y G 1S4. Given a left ideal C of 1S4 and a probability measure 



w on 1S4, the semigroup walk (6.1) on C associated to w induces a stochastic process on 



the state space 0(C), in the sense of Section 2.3 Since is surjective, the image 0(C) is a 



left ideal of This setup generalizes that of the map / : J 7 ^ — > of face semigroups 
of Section |3j where B is a subarrangement of a hyperplane arrangement A and f(F) is 
the unique face of B which contains F, for F G T_\. The following proposition generalizes 
Corollary 3.2 



30 



CHRISTOS A. ATHANASIADIS AND PERSI DIACONIS 



Proposition 6.2. Let : Sa — > Sb be an epimorphism of semigroups, C C Sa be a left 
ideal and w be a probability measure on Sa- For every starting distribution on C, the 
stochastic process on 0(C) which is induced from the semigroup walk on C associated to w 
by the map is Markov. Moreover, this induced chain is itself a semigroup walk on 0(C), 
with associated probability measure w* on S& defined by 



(6.8) 



UT (Z) 



x£Sa- <t>( x )- 



W(X) 



Proof. This follows by computations similar to those in the proofs of Proposition |3.1| and 
Corollary □ 



Suppose now that 



: Sa — > Sb is an epimorphism of finite bands. The definition of the 

induces an order preserving, 



support semilattice in [12], Section A. 2] and (6.2) imply that 
surjective map 0* : La — > Lb of the associated semilattices which makes the diagram 



S 



A 



Sr- 



supp^ 



suppg 



^A - 

commute, where supp_4 and supp B are the support maps of Sa and Sb, respectively. We 
denote by C4 and Cg the set of chambers of Sa and Sb, respectively. Since C4 is a left 
ideal of Sa, the image 4>{Ca) is a left ideal of Sb- 

Lemma 6.3. Let Sa and Sq be finite bands with sets of chambers Ca and Cg, respectively, 
and let : Sa — > Sq be an epimorphism of semigroups. 

(i) We have <p{C A ) Q C B - 

(ii) If Sa is left-regular, then 4>{Ca) = Cg. 

Proof. To prove (i), suppose that c 6 Ca- Then we have supp^x) < supp_4(c) in L^ for 
every x e Sa- By (6.2), this means that c = cxc holds in Sa for every x G Sa- Since 

(c)^0(c) for every z G Sb 
He) G C B - 



By 



is an epimorphism of semigroups, it follows that 0(c) = 
reversing the first part of the argument, we conclude that 

Assume now that Sa is left-regular. It was shown in (TTJ Sections 2.2 and B.3] that the 
relation defined by letting x ^a y xy = y for x,y <E Sa, is a partial order on Sa 
and that the chambers of Sa are precisely the maximal elements of -<a- Similar remarks 
hold for the band Sb, which is also left-regular as a homomorphic image of Sa- To prove 
(ii), suppose that d G Cg and let x G Sa be such that (f)(x) = d. Then there exists c G Ca 
such that x ^a c - Clearly, the map : Sa — > Sb is order preserving and hence d ^b 0( c )- 
Since d G Cg is maximal in -<b, we must have d = 0(c). This shows that d G 0(C_4) and 
hence that Cg C 0(C^). In view of part (i), it follows that 0(C^) = Cg. □ 



Lemma (^3 implies that if : Sa — > Sb is an epimorphism of finite left-regular bands 
and C = Ca, then the induced Markov chain of Proposition 6.2 is a semigroup walk on 



the state space Cg of chambers of Sb- Thus all conclusions of Theorem 6.1 apply to the 
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induced chain. We leave it to the reader to formulate the exact analogue of Theorem 3.3 
in this situation and end with a remark on the rest of the material of Section |3l 



Remark 6.4. The proofs of parts (i) and (iv) of Theorem 2.1, given in Section |3j ext end 



easily in the setup of Theorem 6.1 
the equality 



For part (i), for instance, one should replace (3.13) by 



(6.9) 



#{c eC s :xc = c}= 

u£L: supp(x)<£U 

for x G S, where the m u are defined by (6.6). A computation similar to that given in 
the proof of Theorem 2.1 (i) in Section [3| equation (6.3) and a slight variant of Lemma 



3.4 then imply that the m u are necessarily nonnegative integers and that (6.4) holds. To 



check the validity of (6.9), we observe that the set {c E Cs ■ xc = c} is equal to the set of 
chambers of xS (see, for instance, [121 Example A. 13]). Thus the left-hand side of (6.9) 
equals n(v), where v = supp(a;) G L, and hence ( |6.9[ ) is equivalent to 



n\v) 



for v G L. This is in turn equivalent to (6.6) by Mobius inversion on L. 



details of the coupling proof of (6.7) to the interested reader 



We leave the 
□ 
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